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Abstract 

It is widely believed that the particular wiring observed within cortical columns boosts neural compu- 
tation. We use rewiring of neural networks performing real-world cognitive tasks to study the validity of 
this argument. In a vast survey of wirings within the column we detect, however, no traces of the proposed 
effect. It is on the mesoscopic inter-columnar scale that the existence of columns - largely irrespective of 
their inner organization - enhances the speed of information transfer and minimizes the total wiring length 
required to bind the distributed columnar computations towards spatio-temporally coherent results. 
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Author summary 

Cortical columns with their wiring substructures are a biological fact known for more than 
one hundred years. Yet, their function and computational role have largely remained unexplained. 
Recent computational work has put forward that this organization provides a direct computational 
advantage. A corresponding conclusion was drawn from an artificial computational context using 
detailed simulations of neurons and their interactions, but up to date, this finding has not been 
validated in the context of real-world applications. While covering an extremely wide range of 
network configurations, neuron and network models and cognitive tasks, we find no sign of a com- 
putational boost triggered by the inner-columnar template. We provide an alternative explanation: 
Doubly fractal connectivity laws with exponents as found in the cortex will optimize the speed of 
signal transfer in the cortex using minimal wiring length. While such laws do not necessarily im- 
ply cortical columns, a self- similarity of columnar structures would be a simple way to implement 
this connectivity. 

INTRODUCTION 

Towards the turn of the 19'th century, J. P. Miiller, E. du Bois-Reymond and H. von Helmhokz 
O discovered that neurons are electrically excitable and this predictably affects the electrical 
state of connected neurons. Shortly after, Golgi and Ramon y Cajal [[31 provided their Nobel- 
prize winning description of neuronal and cortical architecture, revealing in the case of the human 
neocortex striking columnar structures divided into six layers. Ever since this discovery it has 
remained a question to what extent neuronal physics and the cortical architecture could account 
for the exquisite properties of the human brain, at least within the scope limited by Godel's theorem 
[|3l . Recent attempts at solving this problem concentrate on physically building the brain, using 
electrical neurons organized according to the cortical blueprint (e.g., ^]). Motivated by typical 
construction constraints encountered in chip making, we explore three potential benefits of the 
cortical wiring template. On the inner-columnar scale we measure the effect that the columnar 
wiring has on real- world pattern recognition tasks in a framework [[5l HI that permits to measure 
recognition rates without compromising the columnar wiring by the learning process. On the inter- 
columnar scale, the effects on the speed of information transport and computation are analyzed in a 
framework [|71[8][ that offers analytical methods with results valid for sufficiently general situations. 
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Our approach is to start from networks in which details of cortical architecture are implemented. 
Using various rewiring processes, we move away from the biological blueprint, measuring the 
effects of the removal. On the columnar level, our work parallels some investigations performed 
in Ref. [|9l in a more abstract setting, where a beneficial influence of microcircuit structures on 
computation was reported. 

BIOLOGICAL DATA, COLUMN MODELS, MODELS OF NETWORKS OF COLUMNS 

Biological data: We use the biological data collected by Roerig et al. IfTOl for a similar context. 
A Log-Log-plot adaption of their original data (see Fig. 1) evidences a break between two decay 
laws at the scale indicated in Fig. 1 by the dashed vertical lines. These lines mark roughly the 
extension of a (physiological) cortical column. Roerig et al. [fTOll also noted that 'A small frac- 
tion of inputs originated more than one mm away'. While the displayed data might suggest two 
power laws, across the columnar scale we will work with an exponentially decaying connectivity 
probability. The difference to a power-law decay would here be small, and the exponential decay 
allows the direct comparison of our results to similar work performed in Ref. (911. An extremely 
broad model survey will demonstrate that inner-columnar wiring following biological templates 
have no computational advantage. At inter-columnar scales, i.e. networks taking whole columns 
as the nodes, a power-law decay will be implemented, and be compared to other connectivities. 
Power law connectivity will be found to be most beneficial if beyond the data range covered by 
Roerig et al. this power-law decay gives way to a milder power-law, expressing the observation 
that over very long distances, the connection probability should go to zero not too quickly. Mo- 
tivated by an approximate self-similarity over the microcolumn-column-hypercolumn scales, we 
will assume that the exponent associated with the slow decay will be close to the one estimated 
across the columnar distance. Our results do, however, not critically depend on the exactly values 
of the exponents, only on their relative ordering is of relevance. Our implementation leads to a 
network that optimizes information transfer at minimal wiring cost. 

Columnar computational nodes: Despite one hundred years of intense investigations a pre- 
cise correspondence between functional and physiological columns has not been obtained [fTTI . 
Nonetheless, the wide-spread conception is that cortical computation takes essentially place within 
a column. To test the effect of wiring structure on computation, we therefore first measure to what 
extent inner-columnar wiring contributes towards computation. We implemented two levels of 
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FIG. 1: Biological data: Logarithmic density of photostimulation-evoked excitatory (a) / inhibitory (b) 
synaptic inputs in concentric rings spaced 50 fim apart, from 19 pooled layer 2/3 neurons (adapted from 
Ref. ifTOl ). A small fraction of inputs originated more than one mm away. The vertical dashed lines mark 
the typical extension of a (physiological) column. While the data might suggest the presence of two power- 
laws, we will computationally implement power-law connectivity only across the intermediate scale. On 
the columnar scale, we will work with an exponential decay. Beyond the scales represented by the data, 
we propose the existence of a power-law of slower decay (tilted dashed lines) ensuring the existence of a 
minimal amount of connections across large scales. 

cortex-inspired wiring structure (cf. Fig. 2). Upon gradually eliminating architectural details by 
randomly rewiring the connections, we will measure the impacts that connectivity details have 
on cognition and computation. For a network realization, only connection probabilities shall be 
prescribed, contrary to what the term 'cortical microcircuit' used in [|9l might evoke. Throughout 
all experiments, the abundance of inhibitory neurons within the population of neurons was kept at 
20 percent. 

In the simple excitatory-inhibitory EI network model, the biological architecture is reduced 
to an excitatory and an inhibitory neuronal population and the connections within and between 
them. To vary the network structure within the EI model frame, we use a parameter A ruling the 
probability for a connection from neuron j to neuron i according to 



Pcon(i, j) = C(i, j) ■ exp 



(1) 
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FIG. 2: I) a) El-model, b) El-control network (uniform synaptic weights w, A = 2). pcon- probability of 
a synaptic connection among neurons of distance d for C- values as in the text, w: synaptic strength of the 
connections. II) a) LEI-model, b) LEI-control network. The input vectors displayed at the right hand side 
of the matrices show how much input the respective populations receive. 

where 

dj j — |x,- — Xyl 

is the Euclidean distance between the /'th and the j'th neurons' positions in the neural network (see 
end of paragraph). As A controls both the number and the typical length of the connections, varying 
from unconnectedness (A = 0) over local next- neighbor connectivity (A = 1) to global connectivity 
(A = oo), this parameter we will use to scan different network structures. C{i,j) establishes the 
connectivity among excitatory (='E') and inhibitory (='1') neurons, established by means of one 
pooled synapse. Our choice C(E,E) = 0.3, C(£,/) = 0.4, C(I,E) = 0.2 and C(/,/) = 0.1 
reflects the typical biological connectivity. If a connection is made, the synaptic weights are 
drawn from a uniform distribution over [0, 1], multiplied by the weight factors w{E,E) = 30, 
w(E,I) = -19, w{I,E) = 60 and w(I,I) = -19. The model is compared to a control network 
where C is uniformly set to 0.3, the synaptic weights are again drawn from a uniform distribution 
over [0, 1], but endowed with a uniform weight factor scaled to match the total weight of the non- 
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control networks and endowed with a sign to distinguish between inhibition and excitation. With 
the help of A we can assess a huge range of neural network architectures. There is, however, one 
important issue that we need to take care of additionally. Whereas in classical neural network 
theory the role of hidden neurons is crucial, in the classical liquid state network paradigm that 
we will use below, the input is relayed to every reservoir neuron, which renders all neurons non- 
hidden. To compensate for this shortcoming, we will also vary the fraction / of input-receiving 
reservoir neurons. To simplify the comparison of results, we will mainly discuss results obtained 
for EI networks chosen as in Ref. [fT3ll on a three-dimensional grid of3x3x 15 = 135 neurons. 
We checked, however, that the obtained results also hold for larger networks and exhibited, where 
this was not the case. 

In the more detailed LEI network model, also the biological layering structure is implemented. 
The LEI network is composed from three layers (2/3, 4 and 5/6), each of them containing an 
excitatory and an inhibitory population. The network contains again 135 neurons, with recurrent 
connections within the individual layers and connection probabilities and strengths as in Ref. [|9l. 
As in the biological example input mostly feeds into layer 4 (input stream 1 in [9]). Layer 2/3 
is the hidden layer, the output neurons are confined to layer 5/6. A family of control networks 
parametrized by p e [0, 1] is obtained by replacing at each synapsewith probability p the pre- and 
postsynaptic neurons by neurons chosen from the pooled neuronal ensembles of the same kind 
(excitatory or inhibitory). This rewiring procedure retains the overall connectivity and weight 
distribution between the excitatory and inhibitory populations, but gradually removes the three- 
layered structure. 

The measurement of the computational effect of inner-columnar wiring structure is done within 
the framework of a reservoir computing neural network. While not in all aspects of top-class ef- 
ficiency among the possible network types [fT2]| . reservoir computing has successfully been used 
in robot motion planning [5|, despite the linear decision boundaries that it implements. In these 
networks, learning is confined to the network's periphery (see Fig. 3 for a conceptual drawing), 
which allows to assess the pure effect of the inner-columnar wiring structure on computation with- 
out being compromised by the learning process. In the original versions of reservoir computing, 
the neurons of the 'reservoir' are randomly connected in a recurrent fashion. Reservoir neurons 
receive external input from the signal and recurrent input from other reservoir neurons. In order 
to parallel the biological example, we will implement models of spiking neurons, which turns the 
network into what is often called a Liquid States network [|9l lT3] - [T5ll . Ref. [|9l reported a marked 
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performance increase when columnar fine-structure was added into Liquid States networks. Their 
evidence was, however, extracted from artificial contexts that were not tested for practical rele- 
vance. We will first show that for real-world cognitive tasks, we do not find signatures of such 
a computational boost. Then we will zoom out from the columnar to the inter-columnar scale, 
which will off'er an alternative argument for the existence of cortical columns. 
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FIG. 3: Reservoir computation model: Stimulus u is associated with output y. Input u is relayed via weight 
matrix W;„ to the reservoir. In the drawing, the synapses in the reservoir (s) are artificially separated from 
neuron membrane potential (v) dynamics and the readout neurons are artificially separated from the rest 
of the reservoir. Recurrent reservoir topology is encoded in matrix W via the choices of s. 'Learning' is 
confined to matrix Wgut- 



Reservoir computing is a supervised process to associate k pairs 

{u(t)i , y(t)i}i^{i,..,k] 

of input / output sequences of individual sequence length T; (so that t e {1, .., T,}). The dimension- 
ahty of the input vectors is denoted by A^„, the dimensionality of the output vectors by Ny. Upon 
stimulation by the input sequence u(0(, the liquid reservoir of A^;^ neurons generates a state vector 
x(t)i of the same dimension. Let T = YiTi denote the total time spanned by the input patterns, and 
let X be the A^;^ x T-matrix of states. Let Y denote the Ny x T-matrix of the associated patterns. The 
desired relation 'WouMf)i - yd(t)i leads directly to the least-squares optimized read-out matrix 

^out — YX"*", 

if X"^ is the (Moore-Penrose) pseudo-inverse of X. In typical appUcations, the system should 
respond to a stimulus n(t) by the desired temporal pattern y(t). During the learning phase, the input 
and the desired output signal are fed into the reservoir. Care is taken that the scaled input signal 
optimally stimulates the respective target neuron models (see below). After a transient phase. 



the optimized output matrix Wout is calculated. This step corresponds to the learning process in 
classical neural networks. In this framework, network realizations are captured in the connection 
weight matrix W, whereas 'learning' in the traditional sense is confined to the read-out matrix 
Wo„f. Excitatory connections are reflected by positive synaptic weights s, inhibitory connections 
by negative weights and absence of connections by zero weights. 

For all networks, the average neuronal activity is determined by the overall scaling of the input 
and the synaptic weights (and in the case of the Izhikevich neurons also by the background current) 
and by the wiring matrix W. To ensure that networks of a similar level of neuronal activity are 
compared, the matrices W chosen according to the wiring model were scaled to obtain a common 
largest eigenvalue (1 for EI networks, 0.2 for LEI networks). Matrix Win was chosen by drawing 
from across [-0.2, 0.2] uniformly distributed random numbers (in the case of Izhikevich neurons 
(see below) scaled by a factor of 30, to arrive at the standard parameter scale). The input and 
synaptic efliciencies were scaled so that neurons could be excited by their presynaptic partners 
without reliance on input, but that the firing rates in response to excitation from both input and 
presynaptic spikes were also sufficiently distant to saturation (f^at = I It). This procedure and the 
chosen parameters ensured that during all the parameter sweeps we performed, the network was 
confined to the same dynamic range. 

We used two methods of reading out the reservoir. In the original Liquid States network, the 
readout is immediate, i.e. memoryless {'ml'). For every input vector an output vector is generated, 
allowing "anytime recognition" [13]. For classification tasks it may be advantageous to have a 
memory span of a size comparable to the stimulus length. Otherwise the Liquid States network 
will confuse stimuli containing similar parts (e.g. phonemes in speech recognition). This is why 
we also used an alternative read-out method. For every neuron and stimulus, a stimulus-averaged 
firing rate yielded the read-out vector. This method (referred to as integration {'int') readout) 
leads to significantly improved recognition rates, although it deviates from the usual Liquid States 
network paradigm. 

To arrive at statements that are largely independent of the network elements, we tested two 
rather distinct models of the neuronal membrane voltage dynamics v{t) in Fig. [3j The leaky 
integrate-and-fire neuron dynamics is defined by 



Sj is the postsynaptic potential at the synapses innervated by the fth neuron, which is weighted 
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FIG. 4: Arabic Digit MFCC input (color-coded), processed by a LEI neural network with spiking neurons 
and by its control network. The spike trains of the two networks are similar because the position number of 
the excitatory and of the inhibitory neurons are maintained. 



by the synaptic efficiency w,y between presynaptic neuron j and postsynaptic neuron /. ui denotes 
the Z'th input component which is weighted by the connection strength from the /'th input 
component to neuron i. We use a membrane time constant t„, = 30 ms, r is the integration time- 
step. If Vi reaches the threshold Vthr = 1, a spike is triggered, which resets the synapse 5, to 1 and v,- 
to Vi = V,-es = 0. The fast-spiking simple Izhikevich neuron dynamics [il6i is given by the coupled 
equations 



Vi(t + t)= Vi(t) + T 
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The additional variable r, controls sub-threshold dynamics and refractoriness. If v, reaches the 
threshold Vthr = 30, a spike is triggered, which resets Si to 1, v, to -65 and r, to -I- 2. For both 
neuron models, the synaptic signal experiences an exponential decay Si{t + t) = exp (-7^) Si(t), 
with a constant Tsyn = 2 ms. The axonal delays and the refractory periods are determined by a 
fixed integration step of t =2 ms. The transcription from the dynamic synapses used in [|9l to our 
exponential synapses is achieved by setting our synaptic weights equal to the steady state strength 
U of [9]. An example of an input signal processed by reservoir networks is given in Fig. |4j 

Networks of columns: When zooming out from the columnar dimension to the inter-columnar 
scale, we have to wrap up the columnar computation and relate it to the computations performed 



by other columns. Rulkov IfTTl [T8l has demonstrated that any desired neuronal firing behavior 
representing columnar response can be expressed by a suitably chosen discrete map. The natural 
model then to use is a coupled map lattice [0 [8]| of chaotic maps, to have the response flexibility 
required for communication. In our modeling of bio-inspired connectivity, the probability that two 
lattice sites j of distance J, y are connected is 

Pij = e ■ dif" + {I - 9) ■ dij-^ (2) 

which specifies the connectivity matrix, see Fig. [6^. By choosing a, /3 and 9, a range of network 
architectures similar to those explored in the Liquid States network paradigm can be accessed. 
Given 9=1, the system can be changed from a globally coupled network (a = 0) into a nearest- 
neighbor coupled network (a ^ oo). For < 9 < l,jS = 0, a^oo, the network is coupled to the 
nearest neighbor with probability 1 and to all other nodes with probability (1 - 6^) up to the cutoff 
M. As a result we obtain a combined nearest neighbor- and random-coupled network. For all 
intermediate values of a and /3, the network is fractally coupled. The cutoff value M determines, 
together with the underlying topology, the average number of connected nodes k. The interaction 
of the local chaotic site maps / is modeled by diffusively coupled evolution of the site states 

Xi(t + 1) = (1 - s)f(xi(t)) + j J] f{Xj{t)), (3) 

' jeconn 

where t denotes discrete time. A, the number of connections to the /'th site indexed by j, and e the 
coupling strength. 

The computation performed at this scale will be characterized by the network's ability to 
quickly propagate incoming information and by its ability to generate among the affected cells 
a coherent state, expressing that 'computation' has emerged [19|. Both abilities depend crucially 
on the number of connections k that impinge on a cell. Spatio-temporal information propagation is 
the maximal velocity of the propagation of perturbations through the network. As a basic approx- 
imation, this propagation is the result of two independent contributions: The chaotic instability 
of the map leads to an average exponential growth of the initial infinitesimal perturbation Jo ap- 
plied at site 0, whereas the diffusive coupling that results in a Gaussian spreading. The combined 
perturbative effects at site / are then expressed by the equation [|20ll 



\5xm ^ -^e 



4Dt 



where D denotes the diffusion coeflicient, A is the Lyapunov exponent of the site map, and do the 
perturbation strength. The velocity v of the traveling wave front is determined at the borderhne of 
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damped and undamped perturbations, which imphes that v depends on D according to Il20l 

V = 2 ■ Vd. (4) 

For a given site map /, the speed of information transfer ('SIT') is therefore determined by Vd 
and all that remains to be done is to estimate D from the network via the mean transition time. 
This quantity can be determined from the network topology alone, using a Markov chain approach. 
Additional network features can be implemented via the connectivity matrix. Implementation of a 
detailed columnar structure left our total wiring length results unchanged. 

Technically, the ability to generate a coherent state is expressed by the cells' ability to synchro- 
nize in a generalized sense. For synchronization, a minimal number k of connections are required 
to impinge on a given site. Full dynamical synchronization of the chaotic sites continues to exist 
if the condition [1211 | e'^ - sfj./, \< 1 (where A is the site map Lyapunov exponent and //^ are the 
nonzero Eigenvalues of the graph Laplacian) is maintained. This simple criterion may be overly 
severe, but it is indicative of what will be found on finer levels of description as well. 

Difl'erent network architectures should therefore be compared under the constraint of an equal 
number of connections k. Biologically relevant indicators for the efficiency of the network will 
then be the speed of information transfer through the network and the total wiring length required 
for synchronized columns. 

RESULTS 

On the columnar scale, where we focus on computation, two popular time series classification 
tasks serve as real-world benchmarks, in contrast with the more abstract computations considered 
in Ref. [|9l. Single Arabic Digit speech recognition [22] is based on time series of 13 Mel Fre- 
quency Cepstral Coefficients for 10 classes of digits spoken by 88 subjects (cf. Fig. 4). Australian 
Sign Language (Auslan) recognition is based on time series of 22 parameters for 95 signs, recorded 
from a native signer using digital gloves and position tracker equipments [|23l . We investigated 
the influence of cortical organizational structures on two levels of architectural sophistication: A 
simpler excitatory-inhibitory EI network and the more detailed layered excitatory-inhibitory net- 
work topology LEI. We start the discussion of our results from Liquid States network with two 
general observations clear from Fig.|5| Whereas the particular neuron models (and the underlying 
circuit parameters) are of secondary influence (blue vs. red curves), the integration readout (right 
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panels) has a clear advantage over instantaneous readout (left panels). The results obtained for 
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FIG. 5: Recognition rate R for a) Arabic Digit, b) Auslan Sign recognition, using leaky integrate and fire 
(blue curves), or Izhikevich (red curves) neurons in the networks. Each data point is the average over 20 
realizations. Left column: memoryless ('ml'), right column: integration ('int') readout. Networks (cf. 
Fig. [2]l: I) EI network, recognition rate dependence on connectivity A (control networks: dashed curves), 
and on the ratio / of input receiving neurons at local connectivity at A = 2. Ocher: Izhikevich neurons with 
A = (no connections). II) LEI networks, recognition rate dependence on the rewiring probability p. p = 0: 
layered, p = I: homogeneous control network. 

the EI- network demonstrate that the connectivity expressed in terms of A does not enhance the 
computational power of the network. The plot also confirms that local connectivity A ~ 2 plays 
no distinguishable role among the possible connectivities. Having no recurrent connections at all 
among the reservoir neurons does not hamper the recognition rate, suggesting that extremely lit- 
tle computation is owed to synaptic interaction [25 J. One may suspect that the low recognition 
rates from memoryless readout are because in the classical Liquid States network paradigm the 
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input signal is applied to all neurons, which constantly overwrites memory that otherwise would 
be retained in 'hidden' neurons. To exclude this possibility, we examined in Fig. [5js second row 
the role of the hidden neurons, by measuring R for networks having a fraction / of input signal 
receiving neurons. The desired value of / is achieved by removing from a corresponding number 
of reservoir neurons the input signals. To compare with Refs. lfT3ll24l| . the connectivity was set to 
local next neighbors (A = 2), except for one test using Izhikievich neurons with A = (ocher line). 
If hidden neurons were beneficial, we, again, should perceive a maximum of R for some optimal 
value of /. In the Arabic Digit task with memoryless readout we do not observe a dependence on 
the number of actually used neurons (i.e. beyond / = 0.1, where we have on average 13.5 input 
receiving neurons, at an input dimensionality of 13). The similarity of the results obtained for 
A = and for A = 2 suggests that the nonlinear interaction among the input receiving neurons does 
not significantly enhance performance. In the Auslan task we see a monotonous dependence of R 
on /, because for most values of / the number of input receiving neurons is smaller than the input 
dimensionality (i.e., we have / ■ 135 < 95). As a function of A, in the biological setting Izhikievich 
neurons tend to globally phase-lock to the strong inner excitation, which leads to a somewhat re- 
duced input-responsiveness. The details of why the eff"ect emerges exactly in the biological setting 
is not clear. The EI network with biology-motivated wiring structure thus does not perform sig- 
nificantly better than the control network. The results obtained for the LEI networks reflecting to 
more details the columnar layering structures (see Fig.|5]ll) corroborate the observations made for 
the simpler model: A significant dependence of R on the rewiring probability p was not observed. 
These observations are compatible with earlier findings for echo state networks 

On the mesoscopic scale, instead of the recognition rate we focus on the information transport 
across the cortical network, and on the material (i.e., the network's total wiring length) needed 
for obtaining a spatio-temporally coherent computation. To demonstrate the beneficial eff"ects by 
a columnar organization of the network, a coupled map lattice [|7l HI is now a more appropriate 
network model than the Liquid States network (see last section). In the coupled map paradigm, 
neuronal interaction is modeled by chaotic site maps communicating by way of diffusive coupling 
(Eq. 3). When we measured the speed of information transport through the network for dou- 
bly fractal, single fractal, random, and nearest-neighbor, coupling topologies, we indeed found a 
strong dependence on the wiring topology. For the doubly fractal architecture suggested by the 
data of Roerig et al. we find a consistently enhanced speed of information transfer if compared 
to the alternative networks (Fig. upper panel). In this figure we plot the speed of information 
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FIG. 6: a) Main network densities compared (p: connection probabilities, d: distance, M: cutoff, see text). 
Below: examples of 'fractal' wirings, b) Speed of information transfer 'SIT' as a function of average 
number of connections k established by Eq. 2. From top: doubly fractal {G - 0.2, a = 0.5, /3 = 2.0), 
fractal (0 = I, a = 0.7), random, next-neighbor topology. Networks sizes: N = 4096. Data points are 
averages over 100 realizations. Lower panel: Typical number of connections k required for synchronization 
(numbers) and corresponding relative total wiring length TWL (histogram height). N = 512. Average over 
10 realizations. 

transfer in arbitrary units, that scale by the square-root of the positive Lyapunov exponent charac- 
terizing the chaotic site maps (Eq. 4). Speed enhancement persists across a wide selection of pairs 
of exponents as long as the qualitative size of the exponents is preserved [26] . Our numerical ex- 
periments show that under the condition of synchronization, the doubly fractal architecture jointly 
optimizes total wiring length TWL and speed (Fig.[6|5), lower panel). 



DISCUSSION 



Our inner-columnar wiring computational experiments focused on time series classification 
problems, because this task class seems to be typical for everyday work performed by the mam- 
malian cortex. Across a wide variety of network realizations, we found no evidence of a micro- 
architectural performance boost. A significant dependence of R on the rewiring probability p is 
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in our experiments not detectable. Layer segregation and special inter-layer connectivities do not 
lead to improvements over fully random circuits. The recognition enhancement observed with the 
integration readout is an effect of input/output neuron sparseness. Obviously, much more work 
on understanding the dynamics and computations in recurrent neural networks needs to be done if 
we would like to argue that observed detailed biological wiring schemes facilitate efficient com- 
putation. At the present stage, it rather seems that exact connection statistics play no eminent 
computational role. Instead, their role could be to keep the neurons in an overall dynamic regime, 
allowing them to be maximally input-sensitive. Computational benefits could still be located in 
the neuronal fine structures. This level is necessarily disregarded when modeling generic columns 
from measurements across a large number of cortical sites, subjects or even across species. Our 
focus on given biological data, and the particular comparisons we decided to deliver, did not allow 
us to fully exploit the power of liquid states networks: Higher recognition rates could be achieved 
with a larger number of neurons and with different time constants. In particular, in practical appli- 
cations of liquid states networks, the amount of computational resources occupied by the synaptic 
interactions would more wisely be invested in an increased neuronal population. This would also 
have the advantage of rendering these networks parallelizable in a simpler way. These constraints 
do, however, not compromise the central finding made in this part of the work: The absence of the 
claimed computational boost by the biologically motivated wiring schemes. 

To a real-world neural network, propagation speed and computation across the mesoscopic 
inter-columnar scale are of equal importance with local computation. At this scale, the network is 
consistently modeled as a coupled map lattice, with its columnar outputs captured by chaotic maps 
communicating via diffusive coupling. We found that the few long-ranged connections present in 
our doubly-fractal model, substantially enhance the speed of information transfer beyond those 
provided by concurrent network topologies. Thus, the columnar structure may have emerged as a 
facilitating structure for speed of information transfer optimization, irrespective of the particular 
values of the exponents a, characterizing the connectivity decay. Moreover, and more impor- 
tantly, doubly fractal networks synchronize at a shorter total wiring length TWL and, at this con- 
dition, keep their superior speed of information transfer SIT. The doubly fractal networks achieve 
this performance irrespective of whether the long/short ranges are implemented by neurons with 
long and short connections, or by neurons with predominantly long connections and neurons with 
predominantly short connections, and whether they are organized in a columnar structure or not. 
From this perspective, the columnar structures may express a sufficient (but not necessary) facili- 
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tating structure of a combined speed of information / minimal wiring length optimization. In fact, 
whereas most monkeys, carnivores and ungulates do have columns, rats and mice don't have them 
(discounting the dedicated structure of the barrel cortex). The characteristics of the optimal net- 
works (mean path lengths and clustering coefficients) emerging in our study are consistently close 
to those of the biological examples (e.g., C. elegans [|27]| ). Moreover, our excitatory-inhibitory 
power law decay exponents of the network connectivity (2 and 1.5, respectively) are close to those 
obtained from a critical avalanche model of cortical computation [28J for avalanche time duration 
and avalanche size, respectively. The question emerges whether there is a direct link between our 
approach and models of cortical computation at criticality [|^ [^ . Detailed numerical experi- 
ments along our framework may reveal the nature of this correspondence. 

Since there is no indication of a phase-transition at a finite system size, we expect that the 
building of large-scale neural networks based on simple electronic neurons arranged according to 
the physiological template will corroborate the two main cortical network features extracted above: 
An increased speed of information transfer and synchronizability at minimal wiring length. 
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